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This paper describes a large computer-coded 
conversational speech data base andiresults of testing phonological 
rules on that data base. The study chows that frequency of rule 
.application depend^s not only on phonological environment's, but also 
on frequency of ' occurrence of specific words. That is, some rules .are 
highly word-dependent, others are more phonoltjgically governed. This 
relationship betWeen rule "application and word frequency suggests 
that differemt kinds of phonological variation should be represented 
in different ways in the lexicon and phonological rule components of 
a speech understanding system. (Author) 
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•It is clear that '^'there is considerable phonological 
variation in conversational speech , and that wordsfTnay be 
realized with various pronunciations depending on the speaker, 
the rate of speech, and the linguistic environments, such as 
surroufeiding consonants, vowels, boundaries, and stress pa-t- 
terns. . 

Muc|i qf this pronunciation variatipn is systematic/ 

'. . f , ' / ■ . ^ 

and can be ^captured, in statements of general, phonoldgicai 

rules. Thp use of phonological rulels to expand' a speech 

understai^ding system lexicon and to enhance word verifica- 

tion procedures has been discussed elsewhere, including ■ 

earlier papers in this session by Cook and ^Woods and Zue[2,3], 

One area which has received relatively little attention 

the study ofNtrequency of occurrence of rule qipplica^tion 

as reflected in natural continuous speech, and ,the estimation 

of actual rule .application probabilities.' The problem has 

been that a large tiody- of carefully transcrilmd sfjeech data, 



wllichf was, neces.sary for statistical ly^^eli^al^ results, had 
no.;t been available previously, nor" were rule-^testing mech-^ 
anisms -available which would efficiently find the total num- . 
ber of rule applxaations /in a ^given phonological 'environment* 
as compared with the total number of occurrences of that 
environment. ' • " « 

The purpose of this paper is 

(1) ..to descril^e such a speech data base and rule-testing 

and rule-application counting mechariisms; 

(2) ''to show„ that the frequency of rule application de- 
pend© not only on phpnological environments, but 

" also on frequency of occurrence of specific words ; 

and 

(3) to suggest how this relationship between rule ^p- 
^ plication and word frequency might be represented 

in the lexicon and phonological rule component of 
a speech understanding system. 



A '^ata base of more than 35 natural speech discourses, 
includii\(g monologues, interviews^ and man-machine protocols, 
has been \caref ully transcribed and computer-coded at SCRL 
using a qUasi-phonemic alphabet kpown as the ARPAbet . The 

transcription includes phonologicial in format id^ such as 

/• . . 

I ■ "\ 

segment insertion, deletion and^subst itu'tion , bu't does not 

include phonetic detail such as ^nasalization or aspiration. 



|-y w^s associated with its 
A minimal and phonologi- 



In addition, to this I^rge; phoqibmic 'data base", excerpts 
from several discourses, have been transcribed and computers 
coded visicig .a phonetic symbol set /which includes more de- 

ill 

^tailed information such as . nasali:|ation , aspiratioti, vov^el 
; , raising , etc . 

A subset of these two data biases was used in the work 
described in this paper. The, data were sorted into alphabetic 
lists, so that each li^xical ent:| 
various dis.course pronunciation is , 

cally reasonable set of base forrtis (i.e. input strings to 
the phopological rules)" was then edited into the entries. 
The. result resembles an expanded pronouncing lexicon - - - 
i.e. a lexical entry, base form(s)v and vario\is actual 
discourse pronunciations. This lexicon will bfe callc^d a 
discourse lejcicon . 

. The ryile-testing and rule application mechanisms were 
developed by pave Brill at SCRL, and are extensions to the 
Bobro\A-Fraser rule-tester [1] . The Bobrow-Fraser rule- 
tester, allows for definition of context-sensitive rules of 

the tyipe shown in Figure 1. 

1 \\ ■ . • ■ - 

Tf\e rule states that W becomes X ^ in the context of Y 



on the left and Z on the right, jji.e. Y W Z - > Y X Z 



case of 



The 



X = 0 (null) is an instance of deletion » W = 0 is 
an instance of . insertion , and W 7^ X 7^ 0 is an instance of 
substitiktion . ' ^ ■ . 
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The SCRL modifications allow^ the user to sibecify 
(a) a rule or set of rules p and // " 

(b.) a discourse lexicon, . // * 

and to match the pronunciations g.enerated by fule with the' 

actual pronunciations in the discourse. An jxample of the 

'output is shown in Figure 2. ^ 

Rule R1.02 consists of the ordered concatenation of three 

rules (not shown here) which 

(a) ireduce 1 stres3 to 0 stress; ' 

(b) change all, J2f stress vowels to'/s/ (AX); 

(c) de\ete word-final /d/ after /nT/. 

This composite.^ rule R102 is applied to/the base form posited 
for each lexic^ entry, and th6 resulting derived form is 
matched against '^he realized discourse forms. -Tor example, 
R102 'is applied t^ the base form fcir "and/" and produces the ^ 
derived form Am (A,E:IND AEND ^> AXND ^> AXN)^^. 

This derived form is checked against the I'S realiza- 
tions of "an4," with the result that MOT (no transf ormatijon ) 
occurred 12 times (i.e. the realized form matches the base . 
form); TRA (transformation) occurred 6 times;, (i.e. the 
realized form matches the derived form), and SOT ^( some other 
transfoirination) occurred 57 times; (i.e. the struHural condi 
tiojas for the rule are met,| but some other rule must have ap- 
plied, because neither the base form nor the derived form 
match these realized forms) . ' . 



A suiraiary list of this information is also provided, for 
the entire discourse lexicon, as ^ shown in Figure 3. For the 
lexical 'entry "and" there are 12 instances of NOT, 6 of TRA, 
and 5 7 of SOT. " , 

Using t;his rule-application counting mechanisin, large 
bodies of speech data can be anal^J^zed, - and frequency of rule 
application statist io^ can be accumulated- 

To illustrate the relationship betv/een frequency of 

rule application and the distribution of rule application with 

• ./ ■ • 

respect to specific words, a single rule was tested against 
discourse data from ^5 speakers. Figure 4 shows the rela- ^ 
tionship between, tfhe number of times a rule applies, the 
number of tim^s it can apply, and the number of words to which 
it applied. 

The rule that was tested involved deletion of word-final 
/t/ or /d/ following /n/. The speech data was made up of 
27, 239 tokens (realizations of words) which repi^esented 
3,519 types (different .words ) . Of these, 1, 925 tokens 
(129 types) contained under lying /-nt/ or /-nd/ clusters, 
i.e. the base form met the structural conditions of the del^^ 
tion rule. There were .650 instances (111 types) of no dele- 
tion, and 123 cases (30 types) in which it was not clear that 
deletion had occurred (e.g. occurrences of /?/ or degemination 
.of {^} # [\] clusters across -word boundaries). That is. 



34% of the tokens showed no deletion, 60% ^underwent deletion, 

and 6% underwent some other transformation. It should be 

' \- 

noted that these figures are independent of the specific wordns 

\ ■ ' . ■ 

which are input strings to the rules. 

figure 5 illustrates how the frequency of appTibation of 
the rule is related to speo4iS,ic words to which "^the rule^ can 
apply. There were 1146 occurrences of the lexical item "and"# 
whose^nder lying form meets the conditions of the /t,d/ dele- 
tion rule. -^^Of these, 970 occurrences showed deletion, and 
173 did not. That is, the single lexical Yfeem "and" repif^- 
sented only 4% of the total number of tokens in the data 
base, but accounted for 84% of all occurrepces of deletion. 

The deletion rule is variable in its application , as 
indicated here. On the other hand, the process of nasali- 
zation appears to occur uniformly whenever the appropriate 
context occurs, i.e. whenever a nasal^' f ollowi^ a vowel, the 
vowel is nasalized. In the phonetically transcribed dat-a from 
three speakers, 417 words (tokens) , there were 129 occurrences 
of the context for nasalization, and 12^9 occurrences of na^^Y' 
izati^n, i.^. the rule applied 100% • of the time. 

It\should be noted that it might seem obvious that lower- 
level mature implementation /such as nasalization should be 
more "regular," or "less optional," but that does not always 



3eem to be true either.) Certain low-level feature imple- 
mentation rules such, as stop insertion between nasals and 
fricatives (e.g. "somethin.g" realized with inserted /p/f 
or "length" realized wibh inserted^/k/) do not occur with such 
regularity. 

These two types of riile application suggest, that when 
applying the notion of "probability of rule application" to 
Automatic speech recognition problems, two measures must be 
considered, both of which can be obtained using the 
data base and rule-testing mechanisms just described. 

One measure is an estimate of probability of application 
of a rule ^jiven that the Unguis tic ^ aonte:^t required by the^ 
rula^ occurred. In actual system development, a large body ^ 
of data based on the system lexicon and protocols, and po- 
tepxial system users , could be processed to giVe^^^^^^accurate 
s)^^,st em- specific figures 

1^ The second measure is an estimate of the probability 
of application of a rule^ (jiViin that a particular word 
( satis fiji'ng the ling^uistic context required by the rule) . 
occurred, ./ . . > - 

TJie difference ^between thes^ two measures was seen in 

Figures 4 and 5. The probability that /t,d/ deletion would 

J. * . 

occur given that the context occurred, is 1152/1925 = .6. 
The .probability that deletion^ would occur given that the 
word "and" .6ccurred is 97G/1146 = .85. If "and" is removed 
from the data base, an estimate of .the probability that 



deletion would apply given that the context occurred would 
.be 182/779 = .23. . 

These two measures can then be evaluated to determine 
how the phonological variation described by the rules can 
be represented in a given system. ' • 

For example, if the probability of "application, given 
the appropriate co/ntext/ is high (e.g. nasalization), then" 
it can be considered a general process" which does not* have 
to be ref lec^fe^indlvidually ih each lexical item but can . 
be applied at another level, e.g. in the analysis-by-synthcsis' 
procedure described in this session by Cook. 

If application of a rule ^^^ears^ to be highly word-- 
dependent (e.g. /t,d/ deletion on "and"), then the pronun- 
ciation variant associated with that rule could be repre- 
sented directly in the lexical entry with a high likelihood,^ 
such that it would be posited first in a word . hypothesis 
and verification scheme. 

A^lthough the speech data described here have been 
carefully transcribed conversational speech, the &ame rule- 
testing and rule-appl icat ion mechanisms can be used^with 
computer transcriptions which reflect botl) phonqlogical 
variation and variation due to machine recognition error. 
Since both kinds of variation effectd-vely look the same to 
the higher levels of speech recognition systejjs, this infor- 
mation may actually be of more immediate benefit iT> system 
development. 
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W =^ X / Y - Z , . 

where: segments* 

> becomes • . 

. . / under the environment 

position occupied by the segment 

UNDER QUESTION (l .E.. X) 

Note: X = 0 . deletion 

W = 0 ir^'SERTION, ■ ' 

W ?^ X ?^ 0 SUBSTITUTION 



Fig ure 1 * 

Format of context-sensitive phonological rules 
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Sample output of applying rule R102 to discourse lexicon 
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FIGURE 3 

Sample summary statistics of applying rule Rr02 to a* 

DISCOURSE LgXICON,. " ^ 
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Data Base: - IS Speakers 
27.239 Tokens . 
^ 3.519 Types 
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Deletion Rule: 
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Number of realizatio^ns (tokens) compared with NiUMBER of 

DIFFERENT WORDS (tYPES) TO WHICH RULE APPLIES,. 
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1146 
173 
970 
3 



REPRESENTS 4,2% TOTAL TOKENS (1146/27.239). 
AND ACCOUNTS FOR 84. £% TOTAL CASES OF DELETION (970/11.52). 
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^Occurrences of /t.d/ deletion on, "and" compared with total 
number of occurrences deletion » 0 
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